Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 8970 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 156.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 15 |
Country has constant value "United States" | Constant |
OrderID has a high cardinality: 4711 distinct values | High cardinality |
OrderDate has a high cardinality: 1228 distinct values | High cardinality |
ShipDate has a high cardinality: 1322 distinct values | High cardinality |
CustomerID has a high cardinality: 792 distinct values | High cardinality |
CustomerName has a high cardinality: 792 distinct values | High cardinality |
City has a high cardinality: 519 distinct values | High cardinality |
ProductID has a high cardinality: 1847 distinct values | High cardinality |
ProductName has a high cardinality: 1835 distinct values | High cardinality |
df_index is highly correlated with RowID | High correlation |
RowID is highly correlated with df_index | High correlation |
PostalCode is highly correlated with State and 1 other fields | High correlation |
Sales is highly correlated with Profit | High correlation |
Quantity is highly correlated with Country | High correlation |
Discount is highly correlated with State and 1 other fields | High correlation |
Profit is highly correlated with Sales | High correlation |
ShipMode is highly correlated with Country | High correlation |
Segment is highly correlated with Country | High correlation |
Country is highly correlated with Segment and 5 other fields | High correlation |
State is highly correlated with PostalCode and 2 other fields | High correlation |
Region is highly correlated with State and 1 other fields | High correlation |
Category is highly correlated with SubCategory | High correlation |
SubCategory is highly correlated with Category and 1 other fields | High correlation |
df_index is uniformly distributed | Uniform |
RowID is uniformly distributed | Uniform |
OrderID is uniformly distributed | Uniform |
df_index has unique values | Unique |
RowID has unique values | Unique |
Discount has 4712 (52.5%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-31 19:04:30.875496 |
|---|---|
| Analysis finished | 2022-10-31 19:05:56.714239 |
| Duration | 1 minute and 25.84 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 8970 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4982.67068 |
| Minimum | 0 |
|---|---|
| Maximum | 9993 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 70.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 508.45 |
| Q1 | 2486.25 |
| median | 4993.5 |
| Q3 | 7451.5 |
| 95-th percentile | 9493.55 |
| Maximum | 9993 |
| Range | 9993 |
| Interquartile range (IQR) | 4965.25 |
Descriptive statistics
| Standard deviation | 2875.094184 |
|---|---|
| Coefficient of variation (CV) | 0.5770187051 |
| Kurtosis | -1.189201873 |
| Mean | 4982.67068 |
| Median Absolute Deviation (MAD) | 2482 |
| Skewness | 0.008813774477 |
| Sum | 44694556 |
| Variance | 8266166.565 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 6616 | 1 | < 0.1% |
| 6610 | 1 | < 0.1% |
| 6611 | 1 | < 0.1% |
| 6612 | 1 | < 0.1% |
| 6613 | 1 | < 0.1% |
| 6614 | 1 | < 0.1% |
| 6615 | 1 | < 0.1% |
| 6617 | 1 | < 0.1% |
| 6608 | 1 | < 0.1% |
| Other values (8960) | 8960 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 9993 | 1 | |
| 9992 | 1 | |
| 9991 | 1 | |
| 9990 | 1 | |
| 9989 | 1 | |
| 9988 | 1 | |
| 9987 | 1 | |
| 9986 | 1 | |
| 9985 | 1 | |
| 9983 | 1 |
| Distinct | 8970 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4983.67068 |
| Minimum | 1 |
|---|---|
| Maximum | 9994 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 509.45 |
| Q1 | 2487.25 |
| median | 4994.5 |
| Q3 | 7452.5 |
| 95-th percentile | 9494.55 |
| Maximum | 9994 |
| Range | 9993 |
| Interquartile range (IQR) | 4965.25 |
Descriptive statistics
| Standard deviation | 2875.094184 |
|---|---|
| Coefficient of variation (CV) | 0.5769029232 |
| Kurtosis | -1.189201873 |
| Mean | 4983.67068 |
| Median Absolute Deviation (MAD) | 2482 |
| Skewness | 0.008813774477 |
| Sum | 44703526 |
| Variance | 8266166.565 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 6617 | 1 | < 0.1% |
| 6611 | 1 | < 0.1% |
| 6612 | 1 | < 0.1% |
| 6613 | 1 | < 0.1% |
| 6614 | 1 | < 0.1% |
| 6615 | 1 | < 0.1% |
| 6616 | 1 | < 0.1% |
| 6618 | 1 | < 0.1% |
| 6609 | 1 | < 0.1% |
| Other values (8960) | 8960 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 9994 | 1 | |
| 9993 | 1 | |
| 9992 | 1 | |
| 9991 | 1 | |
| 9990 | 1 | |
| 9989 | 1 | |
| 9988 | 1 | |
| 9987 | 1 | |
| 9986 | 1 | |
| 9984 | 1 |
| Distinct | 4711 |
|---|---|
| Distinct (%) | 52.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| CA-2017-100111 | 13 |
|---|---|
| CA-2017-157987 | 12 |
| CA-2016-165330 | 11 |
| US-2016-108504 | 11 |
| US-2015-126977 | 10 |
| Other values (4706) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 125580 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2496 ? |
|---|---|
| Unique (%) | 27.8% |
Sample
| 1st row | CA-2016-152156 |
|---|---|
| 2nd row | CA-2016-152156 |
| 3rd row | CA-2016-138688 |
| 4th row | US-2015-108966 |
| 5th row | US-2015-108966 |
Common Values
| Value | Count | Frequency (%) |
| CA-2017-100111 | 13 | 0.1% |
| CA-2017-157987 | 12 | 0.1% |
| CA-2016-165330 | 11 | 0.1% |
| US-2016-108504 | 11 | 0.1% |
| US-2015-126977 | 10 | 0.1% |
| CA-2015-131338 | 10 | 0.1% |
| CA-2016-105732 | 9 | 0.1% |
| CA-2015-132626 | 9 | 0.1% |
| CA-2017-140949 | 9 | 0.1% |
| CA-2015-158421 | 9 | 0.1% |
| Other values (4701) | 8867 |
Length
| Value | Count | Frequency (%) |
| ca-2017-100111 | 13 | 0.1% |
| ca-2017-157987 | 12 | 0.1% |
| ca-2016-165330 | 11 | 0.1% |
| us-2016-108504 | 11 | 0.1% |
| us-2015-126977 | 10 | 0.1% |
| ca-2015-131338 | 10 | 0.1% |
| ca-2015-158421 | 9 | 0.1% |
| ca-2015-164882 | 9 | 0.1% |
| ca-2017-140949 | 9 | 0.1% |
| ca-2015-132626 | 9 | 0.1% |
| Other values (4701) | 8867 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 22887 | |
| - | 17940 | |
| 0 | 13890 | |
| 2 | 13777 | |
| C | 7556 | 6.0% |
| A | 7556 | 6.0% |
| 6 | 7081 | 5.6% |
| 7 | 6682 | 5.3% |
| 4 | 6635 | 5.3% |
| 5 | 6604 | 5.3% |
| Other values (5) | 14972 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 89700 | |
| Dash Punctuation | 17940 | 14.3% |
| Uppercase Letter | 17940 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 22887 | |
| 0 | 13890 | |
| 2 | 13777 | |
| 6 | 7081 | 7.9% |
| 7 | 6682 | 7.4% |
| 4 | 6635 | 7.4% |
| 5 | 6604 | 7.4% |
| 3 | 4905 | 5.5% |
| 8 | 3673 | 4.1% |
| 9 | 3566 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7556 | |
| A | 7556 | |
| U | 1414 | 7.9% |
| S | 1414 | 7.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 107640 | |
| Latin | 17940 | 14.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 22887 | |
| - | 17940 | |
| 0 | 13890 | |
| 2 | 13777 | |
| 6 | 7081 | 6.6% |
| 7 | 6682 | 6.2% |
| 4 | 6635 | 6.2% |
| 5 | 6604 | 6.1% |
| 3 | 4905 | 4.6% |
| 8 | 3673 | 3.4% |
Latin
| Value | Count | Frequency (%) |
| C | 7556 | |
| A | 7556 | |
| U | 1414 | 7.9% |
| S | 1414 | 7.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 22887 | |
| - | 17940 | |
| 0 | 13890 | |
| 2 | 13777 | |
| C | 7556 | 6.0% |
| A | 7556 | 6.0% |
| 6 | 7081 | 5.6% |
| 7 | 6682 | 5.3% |
| 4 | 6635 | 5.3% |
| 5 | 6604 | 5.3% |
| Other values (5) | 14972 |
| Distinct | 1228 |
|---|---|
| Distinct (%) | 13.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| 9/2/2017 | 35 |
|---|---|
| 11/10/2016 | 34 |
| 9/5/2016 | 32 |
| 12/1/2017 | 31 |
| 12/8/2017 | 29 |
| Other values (1223) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.065663322 |
| Min length | 8 |
Characters and Unicode
| Total characters | 81319 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 138 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | 11/8/2016 |
|---|---|
| 2nd row | 11/8/2016 |
| 3rd row | 6/12/2016 |
| 4th row | 10/11/2015 |
| 5th row | 10/11/2015 |
Common Values
| Value | Count | Frequency (%) |
| 9/2/2017 | 35 | 0.4% |
| 11/10/2016 | 34 | 0.4% |
| 9/5/2016 | 32 | 0.4% |
| 12/1/2017 | 31 | 0.3% |
| 12/8/2017 | 29 | 0.3% |
| 12/9/2017 | 29 | 0.3% |
| 12/11/2016 | 28 | 0.3% |
| 11/12/2017 | 28 | 0.3% |
| 11/24/2016 | 27 | 0.3% |
| 12/2/2017 | 27 | 0.3% |
| Other values (1218) | 8670 |
Length
| Value | Count | Frequency (%) |
| 9/2/2017 | 35 | 0.4% |
| 11/10/2016 | 34 | 0.4% |
| 9/5/2016 | 32 | 0.4% |
| 12/1/2017 | 31 | 0.3% |
| 12/8/2017 | 29 | 0.3% |
| 12/9/2017 | 29 | 0.3% |
| 12/11/2016 | 28 | 0.3% |
| 11/12/2017 | 28 | 0.3% |
| 12/2/2017 | 27 | 0.3% |
| 11/24/2016 | 27 | 0.3% |
| Other values (1218) | 8670 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 17993 | |
| / | 17940 | |
| 2 | 14311 | |
| 0 | 10601 | |
| 7 | 4448 | 5.5% |
| 6 | 3795 | 4.7% |
| 5 | 3398 | 4.2% |
| 4 | 3235 | 4.0% |
| 9 | 2080 | 2.6% |
| 3 | 2017 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 63379 | |
| Other Punctuation | 17940 | 22.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 17993 | |
| 2 | 14311 | |
| 0 | 10601 | |
| 7 | 4448 | 7.0% |
| 6 | 3795 | 6.0% |
| 5 | 3398 | 5.4% |
| 4 | 3235 | 5.1% |
| 9 | 2080 | 3.3% |
| 3 | 2017 | 3.2% |
| 8 | 1501 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 17940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 81319 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 17993 | |
| / | 17940 | |
| 2 | 14311 | |
| 0 | 10601 | |
| 7 | 4448 | 5.5% |
| 6 | 3795 | 4.7% |
| 5 | 3398 | 4.2% |
| 4 | 3235 | 4.0% |
| 9 | 2080 | 2.6% |
| 3 | 2017 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81319 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 17993 | |
| / | 17940 | |
| 2 | 14311 | |
| 0 | 10601 | |
| 7 | 4448 | 5.5% |
| 6 | 3795 | 4.7% |
| 5 | 3398 | 4.2% |
| 4 | 3235 | 4.0% |
| 9 | 2080 | 2.6% |
| 3 | 2017 | 2.5% |
| Distinct | 1322 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| 12/16/2015 | 34 |
|---|---|
| 9/26/2017 | 32 |
| 9/6/2017 | 30 |
| 12/12/2017 | 29 |
| 11/21/2017 | 29 |
| Other values (1317) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.07335563 |
| Min length | 8 |
Characters and Unicode
| Total characters | 81388 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 141 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | 11/11/2016 |
|---|---|
| 2nd row | 11/11/2016 |
| 3rd row | 6/16/2016 |
| 4th row | 10/18/2015 |
| 5th row | 10/18/2015 |
Common Values
| Value | Count | Frequency (%) |
| 12/16/2015 | 34 | 0.4% |
| 9/26/2017 | 32 | 0.4% |
| 9/6/2017 | 30 | 0.3% |
| 12/12/2017 | 29 | 0.3% |
| 11/21/2017 | 29 | 0.3% |
| 12/6/2017 | 27 | 0.3% |
| 9/15/2017 | 25 | 0.3% |
| 9/13/2014 | 25 | 0.3% |
| 11/16/2017 | 24 | 0.3% |
| 9/8/2017 | 24 | 0.3% |
| Other values (1312) | 8691 |
Length
| Value | Count | Frequency (%) |
| 12/16/2015 | 34 | 0.4% |
| 9/26/2017 | 32 | 0.4% |
| 9/6/2017 | 30 | 0.3% |
| 12/12/2017 | 29 | 0.3% |
| 11/21/2017 | 29 | 0.3% |
| 12/6/2017 | 27 | 0.3% |
| 9/15/2017 | 25 | 0.3% |
| 9/13/2014 | 25 | 0.3% |
| 9/26/2015 | 24 | 0.3% |
| 9/8/2017 | 24 | 0.3% |
| Other values (1312) | 8691 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 17943 | |
| / | 17940 | |
| 2 | 14399 | |
| 0 | 10508 | |
| 7 | 4489 | 5.5% |
| 6 | 3959 | 4.9% |
| 5 | 3459 | 4.3% |
| 4 | 3148 | 3.9% |
| 9 | 2039 | 2.5% |
| 3 | 1934 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 63448 | |
| Other Punctuation | 17940 | 22.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 17943 | |
| 2 | 14399 | |
| 0 | 10508 | |
| 7 | 4489 | 7.1% |
| 6 | 3959 | 6.2% |
| 5 | 3459 | 5.5% |
| 4 | 3148 | 5.0% |
| 9 | 2039 | 3.2% |
| 3 | 1934 | 3.0% |
| 8 | 1570 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 17940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 81388 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 17943 | |
| / | 17940 | |
| 2 | 14399 | |
| 0 | 10508 | |
| 7 | 4489 | 5.5% |
| 6 | 3959 | 4.9% |
| 5 | 3459 | 4.3% |
| 4 | 3148 | 3.9% |
| 9 | 2039 | 2.5% |
| 3 | 1934 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81388 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 17943 | |
| / | 17940 | |
| 2 | 14399 | |
| 0 | 10508 | |
| 7 | 4489 | 5.5% |
| 6 | 3959 | 4.9% |
| 5 | 3459 | 4.3% |
| 4 | 3148 | 3.9% |
| 9 | 2039 | 2.5% |
| 3 | 1934 | 2.4% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| Standard Class | |
|---|---|
| Second Class | |
| First Class | |
| Same Day | 494 |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 12.81326644 |
| Min length | 8 |
Characters and Unicode
| Total characters | 114935 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Second Class |
|---|---|
| 2nd row | Second Class |
| 3rd row | Second Class |
| 4th row | Standard Class |
| 5th row | Standard Class |
Common Values
| Value | Count | Frequency (%) |
| Standard Class | 5323 | |
| Second Class | 1778 | 19.8% |
| First Class | 1375 | 15.3% |
| Same Day | 494 | 5.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| class | 8476 | |
| standard | 5323 | |
| second | 1778 | 9.9% |
| first | 1375 | 7.7% |
| same | 494 | 2.8% |
| day | 494 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 20110 | |
| s | 18327 | |
| d | 12424 | |
| 8970 | ||
| l | 8476 | |
| C | 8476 | |
| S | 7595 | 6.6% |
| n | 7101 | 6.2% |
| r | 6698 | 5.8% |
| t | 6698 | 5.8% |
| Other values (8) | 10060 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 88025 | |
| Uppercase Letter | 17940 | 15.6% |
| Space Separator | 8970 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 20110 | |
| s | 18327 | |
| d | 12424 | |
| l | 8476 | |
| n | 7101 | 8.1% |
| r | 6698 | 7.6% |
| t | 6698 | 7.6% |
| e | 2272 | 2.6% |
| c | 1778 | 2.0% |
| o | 1778 | 2.0% |
| Other values (3) | 2363 | 2.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 8476 | |
| S | 7595 | |
| F | 1375 | 7.7% |
| D | 494 | 2.8% |
Space Separator
| Value | Count | Frequency (%) |
| 8970 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 105965 | |
| Common | 8970 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 20110 | |
| s | 18327 | |
| d | 12424 | |
| l | 8476 | |
| C | 8476 | |
| S | 7595 | 7.2% |
| n | 7101 | 6.7% |
| r | 6698 | 6.3% |
| t | 6698 | 6.3% |
| e | 2272 | 2.1% |
| Other values (7) | 7788 | 7.3% |
Common
| Value | Count | Frequency (%) |
| 8970 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114935 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 20110 | |
| s | 18327 | |
| d | 12424 | |
| 8970 | ||
| l | 8476 | |
| C | 8476 | |
| S | 7595 | 6.6% |
| n | 7101 | 6.2% |
| r | 6698 | 5.8% |
| t | 6698 | 5.8% |
| Other values (8) | 10060 |
| Distinct | 792 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| JL-15835 | 33 |
|---|---|
| PP-18955 | 32 |
| MA-17560 | 32 |
| WB-21850 | 32 |
| SV-20365 | 31 |
| Other values (787) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 71760 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | CG-12520 |
|---|---|
| 2nd row | CG-12520 |
| 3rd row | DV-13045 |
| 4th row | SO-20335 |
| 5th row | SO-20335 |
Common Values
| Value | Count | Frequency (%) |
| JL-15835 | 33 | 0.4% |
| PP-18955 | 32 | 0.4% |
| MA-17560 | 32 | 0.4% |
| WB-21850 | 32 | 0.4% |
| SV-20365 | 31 | 0.3% |
| EH-13765 | 31 | 0.3% |
| AP-10915 | 30 | 0.3% |
| CK-12205 | 29 | 0.3% |
| JD-15895 | 29 | 0.3% |
| CS-12250 | 28 | 0.3% |
| Other values (782) | 8663 |
Length
| Value | Count | Frequency (%) |
| jl-15835 | 33 | 0.4% |
| ma-17560 | 32 | 0.4% |
| wb-21850 | 32 | 0.4% |
| pp-18955 | 32 | 0.4% |
| sv-20365 | 31 | 0.3% |
| eh-13765 | 31 | 0.3% |
| ap-10915 | 30 | 0.3% |
| ck-12205 | 29 | 0.3% |
| jd-15895 | 29 | 0.3% |
| cl-12565 | 28 | 0.3% |
| Other values (782) | 8663 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 10721 | |
| - | 8970 | |
| 0 | 7600 | 10.6% |
| 5 | 7100 | 9.9% |
| 2 | 4186 | 5.8% |
| 6 | 2613 | 3.6% |
| 7 | 2609 | 3.6% |
| 9 | 2588 | 3.6% |
| 8 | 2548 | 3.6% |
| 3 | 2524 | 3.5% |
| Other values (30) | 20301 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 44850 | |
| Uppercase Letter | 17904 | 24.9% |
| Dash Punctuation | 8970 | 12.5% |
| Lowercase Letter | 36 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1572 | 8.8% |
| M | 1534 | 8.6% |
| C | 1534 | 8.6% |
| B | 1482 | 8.3% |
| D | 1191 | 6.7% |
| A | 1111 | 6.2% |
| J | 1019 | 5.7% |
| P | 1000 | 5.6% |
| H | 870 | 4.9% |
| K | 836 | 4.7% |
| Other values (16) | 5755 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10721 | |
| 0 | 7600 | |
| 5 | 7100 | |
| 2 | 4186 | 9.3% |
| 6 | 2613 | 5.8% |
| 7 | 2609 | 5.8% |
| 9 | 2588 | 5.8% |
| 8 | 2548 | 5.7% |
| 3 | 2524 | 5.6% |
| 4 | 2361 | 5.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 24 | |
| o | 7 | 19.4% |
| l | 5 | 13.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8970 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 53820 | |
| Latin | 17940 | 25.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1572 | 8.8% |
| M | 1534 | 8.6% |
| C | 1534 | 8.6% |
| B | 1482 | 8.3% |
| D | 1191 | 6.6% |
| A | 1111 | 6.2% |
| J | 1019 | 5.7% |
| P | 1000 | 5.6% |
| H | 870 | 4.8% |
| K | 836 | 4.7% |
| Other values (19) | 5791 |
Common
| Value | Count | Frequency (%) |
| 1 | 10721 | |
| - | 8970 | |
| 0 | 7600 | |
| 5 | 7100 | |
| 2 | 4186 | 7.8% |
| 6 | 2613 | 4.9% |
| 7 | 2609 | 4.8% |
| 9 | 2588 | 4.8% |
| 8 | 2548 | 4.7% |
| 3 | 2524 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 71760 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 10721 | |
| - | 8970 | |
| 0 | 7600 | 10.6% |
| 5 | 7100 | 9.9% |
| 2 | 4186 | 5.8% |
| 6 | 2613 | 3.6% |
| 7 | 2609 | 3.6% |
| 9 | 2588 | 3.6% |
| 8 | 2548 | 3.6% |
| 3 | 2524 | 3.5% |
| Other values (30) | 20301 |
| Distinct | 792 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| John Lee | 33 |
|---|---|
| Paul Prost | 32 |
| Matt Abelman | 32 |
| William Brown | 32 |
| Seth Vernon | 31 |
| Other values (787) |
Length
| Max length | 22 |
|---|---|
| Median length | 18 |
| Mean length | 12.9509476 |
| Min length | 7 |
Characters and Unicode
| Total characters | 116170 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Claire Gute |
|---|---|
| 2nd row | Claire Gute |
| 3rd row | Darrin Van Huff |
| 4th row | Sean O'Donnell |
| 5th row | Sean O'Donnell |
Common Values
| Value | Count | Frequency (%) |
| John Lee | 33 | 0.4% |
| Paul Prost | 32 | 0.4% |
| Matt Abelman | 32 | 0.4% |
| William Brown | 32 | 0.4% |
| Seth Vernon | 31 | 0.3% |
| Edward Hooks | 31 | 0.3% |
| Arthur Prichep | 30 | 0.3% |
| Chloris Kastensmidt | 29 | 0.3% |
| Jonathan Doherty | 29 | 0.3% |
| Chris Selesnick | 28 | 0.3% |
| Other values (782) | 8663 |
Length
| Value | Count | Frequency (%) |
| john | 101 | 0.6% |
| michael | 98 | 0.5% |
| frank | 98 | 0.5% |
| patrick | 89 | 0.5% |
| brian | 88 | 0.5% |
| rick | 87 | 0.5% |
| paul | 86 | 0.5% |
| ken | 81 | 0.5% |
| stewart | 80 | 0.4% |
| brown | 76 | 0.4% |
| Other values (899) | 17114 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10723 | 9.2% |
| e | 10670 | 9.2% |
| n | 9248 | 8.0% |
| 9028 | 7.8% | |
| r | 8512 | 7.3% |
| i | 7078 | 6.1% |
| l | 5824 | 5.0% |
| o | 5223 | 4.5% |
| t | 4858 | 4.2% |
| s | 4073 | 3.5% |
| Other values (47) | 40933 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 88617 | |
| Uppercase Letter | 18383 | 15.8% |
| Space Separator | 9028 | 7.8% |
| Other Punctuation | 115 | 0.1% |
| Dash Punctuation | 27 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10723 | |
| e | 10670 | |
| n | 9248 | |
| r | 8512 | |
| i | 7078 | 8.0% |
| l | 5824 | 6.6% |
| o | 5223 | 5.9% |
| t | 4858 | 5.5% |
| s | 4073 | 4.6% |
| h | 3454 | 3.9% |
| Other values (18) | 18954 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1632 | 8.9% |
| S | 1572 | 8.6% |
| M | 1570 | 8.5% |
| B | 1530 | 8.3% |
| D | 1220 | 6.6% |
| A | 1162 | 6.3% |
| J | 1019 | 5.5% |
| P | 1000 | 5.4% |
| H | 903 | 4.9% |
| K | 867 | 4.7% |
| Other values (16) | 5908 |
Space Separator
| Value | Count | Frequency (%) |
| 9028 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 115 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 107000 | |
| Common | 9170 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10723 | 10.0% |
| e | 10670 | 10.0% |
| n | 9248 | 8.6% |
| r | 8512 | 8.0% |
| i | 7078 | 6.6% |
| l | 5824 | 5.4% |
| o | 5223 | 4.9% |
| t | 4858 | 4.5% |
| s | 4073 | 3.8% |
| h | 3454 | 3.2% |
| Other values (44) | 37337 |
Common
| Value | Count | Frequency (%) |
| 9028 | ||
| ' | 115 | 1.3% |
| - | 27 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 116094 | |
| None | 76 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10723 | 9.2% |
| e | 10670 | 9.2% |
| n | 9248 | 8.0% |
| 9028 | 7.8% | |
| r | 8512 | 7.3% |
| i | 7078 | 6.1% |
| l | 5824 | 5.0% |
| o | 5223 | 4.5% |
| t | 4858 | 4.2% |
| s | 4073 | 3.5% |
| Other values (44) | 40857 |
None
| Value | Count | Frequency (%) |
| ö | 53 | |
| ä | 18 | 23.7% |
| ü | 5 | 6.6% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| Consumer | |
|---|---|
| Corporate | |
| Home Office |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.838350056 |
| Min length | 8 |
Characters and Unicode
| Total characters | 79280 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Consumer |
|---|---|
| 2nd row | Consumer |
| 3rd row | Corporate |
| 4th row | Consumer |
| 5th row | Consumer |
Common Values
| Value | Count | Frequency (%) |
| Consumer | 4658 | |
| Corporate | 2708 | |
| Home Office | 1604 | 17.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| consumer | 4658 | |
| corporate | 2708 | |
| home | 1604 | 15.2% |
| office | 1604 | 15.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 11678 | |
| e | 10574 | |
| r | 10074 | |
| C | 7366 | |
| m | 6262 | |
| n | 4658 | 5.9% |
| s | 4658 | 5.9% |
| u | 4658 | 5.9% |
| f | 3208 | 4.0% |
| t | 2708 | 3.4% |
| Other values (7) | 13436 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 67102 | |
| Uppercase Letter | 10574 | 13.3% |
| Space Separator | 1604 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 11678 | |
| e | 10574 | |
| r | 10074 | |
| m | 6262 | |
| n | 4658 | 6.9% |
| s | 4658 | 6.9% |
| u | 4658 | 6.9% |
| f | 3208 | 4.8% |
| t | 2708 | 4.0% |
| p | 2708 | 4.0% |
| Other values (3) | 5916 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7366 | |
| H | 1604 | 15.2% |
| O | 1604 | 15.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1604 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 77676 | |
| Common | 1604 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 11678 | |
| e | 10574 | |
| r | 10074 | |
| C | 7366 | |
| m | 6262 | |
| n | 4658 | 6.0% |
| s | 4658 | 6.0% |
| u | 4658 | 6.0% |
| f | 3208 | 4.1% |
| t | 2708 | 3.5% |
| Other values (6) | 11832 |
Common
| Value | Count | Frequency (%) |
| 1604 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 11678 | |
| e | 10574 | |
| r | 10074 | |
| C | 7366 | |
| m | 6262 | |
| n | 4658 | 5.9% |
| s | 4658 | 5.9% |
| u | 4658 | 5.9% |
| f | 3208 | 4.0% |
| t | 2708 | 3.4% |
| Other values (7) | 13436 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| United States |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 116610 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
Common Values
| Value | Count | Frequency (%) |
| United States | 8970 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| united | 8970 | |
| states | 8970 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 26910 | |
| e | 17940 | |
| U | 8970 | 7.7% |
| n | 8970 | 7.7% |
| i | 8970 | 7.7% |
| d | 8970 | 7.7% |
| 8970 | 7.7% | |
| S | 8970 | 7.7% |
| a | 8970 | 7.7% |
| s | 8970 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 89700 | |
| Uppercase Letter | 17940 | 15.4% |
| Space Separator | 8970 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 26910 | |
| e | 17940 | |
| n | 8970 | 10.0% |
| i | 8970 | 10.0% |
| d | 8970 | 10.0% |
| a | 8970 | 10.0% |
| s | 8970 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 8970 | |
| S | 8970 |
Space Separator
| Value | Count | Frequency (%) |
| 8970 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 107640 | |
| Common | 8970 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 26910 | |
| e | 17940 | |
| U | 8970 | 8.3% |
| n | 8970 | 8.3% |
| i | 8970 | 8.3% |
| d | 8970 | 8.3% |
| S | 8970 | 8.3% |
| a | 8970 | 8.3% |
| s | 8970 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 8970 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 116610 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 26910 | |
| e | 17940 | |
| U | 8970 | 7.7% |
| n | 8970 | 7.7% |
| i | 8970 | 7.7% |
| d | 8970 | 7.7% |
| 8970 | 7.7% | |
| S | 8970 | 7.7% |
| a | 8970 | 7.7% |
| s | 8970 | 7.7% |
| Distinct | 519 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| New York City | |
|---|---|
| Los Angeles | |
| San Francisco | 498 |
| Philadelphia | 433 |
| Seattle | 419 |
| Other values (514) |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 9.423411371 |
| Min length | 4 |
Characters and Unicode
| Total characters | 84528 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 70 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Henderson |
|---|---|
| 2nd row | Henderson |
| 3rd row | Los Angeles |
| 4th row | Fort Lauderdale |
| 5th row | Fort Lauderdale |
Common Values
| Value | Count | Frequency (%) |
| New York City | 896 | 10.0% |
| Los Angeles | 736 | 8.2% |
| San Francisco | 498 | 5.6% |
| Philadelphia | 433 | 4.8% |
| Seattle | 419 | 4.7% |
| Houston | 259 | 2.9% |
| Chicago | 212 | 2.4% |
| Columbus | 199 | 2.2% |
| San Diego | 164 | 1.8% |
| Springfield | 147 | 1.6% |
| Other values (509) | 5007 |
Length
| Value | Count | Frequency (%) |
| city | 969 | 7.4% |
| new | 917 | 7.0% |
| york | 899 | 6.9% |
| san | 767 | 5.9% |
| los | 736 | 5.6% |
| angeles | 736 | 5.6% |
| francisco | 498 | 3.8% |
| philadelphia | 433 | 3.3% |
| seattle | 419 | 3.2% |
| houston | 259 | 2.0% |
| Other values (544) | 6411 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8139 | 9.6% |
| a | 6736 | 8.0% |
| o | 6648 | 7.9% |
| n | 5660 | 6.7% |
| i | 5534 | 6.5% |
| l | 5294 | 6.3% |
| s | 4290 | 5.1% |
| r | 4090 | 4.8% |
| 4074 | 4.8% | |
| t | 4064 | 4.8% |
| Other values (41) | 29999 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 67410 | |
| Uppercase Letter | 13044 | 15.4% |
| Space Separator | 4074 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8139 | |
| a | 6736 | |
| o | 6648 | |
| n | 5660 | 8.4% |
| i | 5534 | 8.2% |
| l | 5294 | 7.9% |
| s | 4290 | 6.4% |
| r | 4090 | 6.1% |
| t | 4064 | 6.0% |
| c | 2166 | 3.2% |
| Other values (16) | 14789 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1871 | |
| S | 1643 | |
| L | 1231 | |
| A | 1157 | |
| N | 1104 | |
| Y | 917 | 7.0% |
| P | 831 | 6.4% |
| F | 744 | 5.7% |
| D | 554 | 4.2% |
| H | 477 | 3.7% |
| Other values (14) | 2515 |
Space Separator
| Value | Count | Frequency (%) |
| 4074 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 80454 | |
| Common | 4074 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8139 | 10.1% |
| a | 6736 | 8.4% |
| o | 6648 | 8.3% |
| n | 5660 | 7.0% |
| i | 5534 | 6.9% |
| l | 5294 | 6.6% |
| s | 4290 | 5.3% |
| r | 4090 | 5.1% |
| t | 4064 | 5.1% |
| c | 2166 | 2.7% |
| Other values (40) | 27833 |
Common
| Value | Count | Frequency (%) |
| 4074 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 84528 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8139 | 9.6% |
| a | 6736 | 8.0% |
| o | 6648 | 7.9% |
| n | 5660 | 6.7% |
| i | 5534 | 6.5% |
| l | 5294 | 6.3% |
| s | 4290 | 5.1% |
| r | 4090 | 4.8% |
| 4074 | 4.8% | |
| t | 4064 | 4.8% |
| Other values (41) | 29999 |
| Distinct | 49 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| California | |
|---|---|
| New York | |
| Texas | |
| Washington | |
| Pennsylvania | |
| Other values (44) |
Length
| Max length | 20 |
|---|---|
| Median length | 14 |
| Mean length | 8.599442586 |
| Min length | 4 |
Characters and Unicode
| Total characters | 77137 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Kentucky |
|---|---|
| 2nd row | Kentucky |
| 3rd row | California |
| 4th row | Florida |
| 5th row | Florida |
Common Values
| Value | Count | Frequency (%) |
| California | 1963 | |
| New York | 1103 | |
| Texas | 695 | 7.7% |
| Washington | 495 | 5.5% |
| Pennsylvania | 471 | 5.3% |
| Ohio | 383 | 4.3% |
| Illinois | 331 | 3.7% |
| Florida | 312 | 3.5% |
| Michigan | 252 | 2.8% |
| Virginia | 217 | 2.4% |
| Other values (39) | 2748 |
Length
| Value | Count | Frequency (%) |
| california | 1963 | |
| new | 1296 | 12.2% |
| york | 1103 | 10.4% |
| texas | 695 | 6.5% |
| washington | 495 | 4.7% |
| pennsylvania | 471 | 4.4% |
| ohio | 383 | 3.6% |
| illinois | 331 | 3.1% |
| florida | 312 | 2.9% |
| michigan | 252 | 2.4% |
| Other values (43) | 3311 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9858 | |
| i | 9077 | |
| n | 7314 | 9.5% |
| o | 6638 | 8.6% |
| r | 5200 | 6.7% |
| e | 4456 | 5.8% |
| l | 4186 | 5.4% |
| s | 3939 | 5.1% |
| C | 2441 | 3.2% |
| f | 1973 | 2.6% |
| Other values (36) | 22055 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 64893 | |
| Uppercase Letter | 10602 | 13.7% |
| Space Separator | 1642 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9858 | |
| i | 9077 | |
| n | 7314 | |
| o | 6638 | |
| r | 5200 | |
| e | 4456 | |
| l | 4186 | 6.5% |
| s | 3939 | 6.1% |
| f | 1973 | 3.0% |
| h | 1748 | 2.7% |
| Other values (14) | 10504 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2441 | |
| N | 1582 | |
| Y | 1103 | |
| T | 848 | 8.0% |
| M | 750 | 7.1% |
| W | 608 | 5.7% |
| I | 583 | 5.5% |
| O | 549 | 5.2% |
| P | 471 | 4.4% |
| F | 312 | 2.9% |
| Other values (11) | 1355 |
Space Separator
| Value | Count | Frequency (%) |
| 1642 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 75495 | |
| Common | 1642 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9858 | |
| i | 9077 | |
| n | 7314 | 9.7% |
| o | 6638 | 8.8% |
| r | 5200 | 6.9% |
| e | 4456 | 5.9% |
| l | 4186 | 5.5% |
| s | 3939 | 5.2% |
| C | 2441 | 3.2% |
| f | 1973 | 2.6% |
| Other values (35) | 20413 |
Common
| Value | Count | Frequency (%) |
| 1642 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 77137 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9858 | |
| i | 9077 | |
| n | 7314 | 9.5% |
| o | 6638 | 8.6% |
| r | 5200 | 6.7% |
| e | 4456 | 5.8% |
| l | 4186 | 5.4% |
| s | 3939 | 5.1% |
| C | 2441 | 3.2% |
| f | 1973 | 2.6% |
| Other values (36) | 22055 |
| Distinct | 617 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54959.33489 |
| Minimum | 1040 |
|---|---|
| Maximum | 99301 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 70.2 KiB |
Quantile statistics
| Minimum | 1040 |
|---|---|
| 5-th percentile | 10009 |
| Q1 | 22153 |
| median | 54601 |
| Q3 | 90032 |
| 95-th percentile | 98103 |
| Maximum | 99301 |
| Range | 98261 |
| Interquartile range (IQR) | 67879 |
Descriptive statistics
| Standard deviation | 32760.86894 |
|---|---|
| Coefficient of variation (CV) | 0.5960928931 |
| Kurtosis | -1.531868321 |
| Mean | 54959.33489 |
| Median Absolute Deviation (MAD) | 35403 |
| Skewness | -0.1081174326 |
| Sum | 492985234 |
| Variance | 1073274534 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10035 | 255 | 2.8% |
| 10009 | 227 | 2.5% |
| 10024 | 227 | 2.5% |
| 94122 | 199 | 2.2% |
| 10011 | 187 | 2.1% |
| 98105 | 163 | 1.8% |
| 94110 | 161 | 1.8% |
| 90049 | 148 | 1.6% |
| 98103 | 148 | 1.6% |
| 94109 | 138 | 1.5% |
| Other values (607) | 7117 |
| Value | Count | Frequency (%) |
| 1040 | 1 | < 0.1% |
| 1453 | 6 | 0.1% |
| 1752 | 2 | < 0.1% |
| 1810 | 4 | < 0.1% |
| 1841 | 32 | |
| 1852 | 16 | |
| 1915 | 3 | < 0.1% |
| 2038 | 17 | |
| 2138 | 6 | 0.1% |
| 2148 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 99301 | 6 | 0.1% |
| 99207 | 7 | 0.1% |
| 98661 | 5 | 0.1% |
| 98632 | 3 | < 0.1% |
| 98502 | 5 | 0.1% |
| 98270 | 1 | < 0.1% |
| 98226 | 2 | < 0.1% |
| 98208 | 1 | < 0.1% |
| 98198 | 7 | 0.1% |
| 98115 | 108 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| West | |
|---|---|
| East | |
| Central | |
| South |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.783723523 |
| Min length | 4 |
Characters and Unicode
| Total characters | 42910 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | South |
|---|---|
| 2nd row | South |
| 3rd row | West |
| 4th row | South |
| 5th row | South |
Common Values
| Value | Count | Frequency (%) |
| West | 3043 | |
| East | 2611 | |
| Central | 1857 | |
| South | 1459 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| west | 3043 | |
| east | 2611 | |
| central | 1857 | |
| south | 1459 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 8970 | |
| s | 5654 | |
| e | 4900 | |
| a | 4468 | |
| W | 3043 | 7.1% |
| E | 2611 | 6.1% |
| C | 1857 | 4.3% |
| n | 1857 | 4.3% |
| r | 1857 | 4.3% |
| l | 1857 | 4.3% |
| Other values (4) | 5836 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33940 | |
| Uppercase Letter | 8970 | 20.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 8970 | |
| s | 5654 | |
| e | 4900 | |
| a | 4468 | |
| n | 1857 | 5.5% |
| r | 1857 | 5.5% |
| l | 1857 | 5.5% |
| o | 1459 | 4.3% |
| u | 1459 | 4.3% |
| h | 1459 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 3043 | |
| E | 2611 | |
| C | 1857 | |
| S | 1459 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42910 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 8970 | |
| s | 5654 | |
| e | 4900 | |
| a | 4468 | |
| W | 3043 | 7.1% |
| E | 2611 | 6.1% |
| C | 1857 | 4.3% |
| n | 1857 | 4.3% |
| r | 1857 | 4.3% |
| l | 1857 | 4.3% |
| Other values (4) | 5836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42910 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 8970 | |
| s | 5654 | |
| e | 4900 | |
| a | 4468 | |
| W | 3043 | 7.1% |
| E | 2611 | 6.1% |
| C | 1857 | 4.3% |
| n | 1857 | 4.3% |
| r | 1857 | 4.3% |
| l | 1857 | 4.3% |
| Other values (4) | 5836 |
| Distinct | 1847 |
|---|---|
| Distinct (%) | 20.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| OFF-PA-10001970 | 18 |
|---|---|
| TEC-AC-10003832 | 17 |
| TEC-AC-10002049 | 15 |
| FUR-CH-10001146 | 15 |
| TEC-AC-10003628 | 14 |
| Other values (1842) |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Characters and Unicode
| Total characters | 134550 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 119 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | FUR-BO-10001798 |
|---|---|
| 2nd row | FUR-CH-10000454 |
| 3rd row | OFF-LA-10000240 |
| 4th row | FUR-TA-10000577 |
| 5th row | OFF-ST-10000760 |
Common Values
| Value | Count | Frequency (%) |
| OFF-PA-10001970 | 18 | 0.2% |
| TEC-AC-10003832 | 17 | 0.2% |
| TEC-AC-10002049 | 15 | 0.2% |
| FUR-CH-10001146 | 15 | 0.2% |
| TEC-AC-10003628 | 14 | 0.2% |
| FUR-CH-10003774 | 14 | 0.2% |
| FUR-CH-10002647 | 14 | 0.2% |
| OFF-PA-10002377 | 14 | 0.2% |
| FUR-CH-10002880 | 14 | 0.2% |
| FUR-CH-10004287 | 13 | 0.1% |
| Other values (1837) | 8822 |
Length
| Value | Count | Frequency (%) |
| off-pa-10001970 | 18 | 0.2% |
| tec-ac-10003832 | 17 | 0.2% |
| tec-ac-10002049 | 15 | 0.2% |
| fur-ch-10001146 | 15 | 0.2% |
| tec-ac-10003628 | 14 | 0.2% |
| fur-ch-10003774 | 14 | 0.2% |
| fur-ch-10002647 | 14 | 0.2% |
| off-pa-10002377 | 14 | 0.2% |
| fur-ch-10002880 | 14 | 0.2% |
| tec-ac-10003038 | 13 | 0.1% |
| Other values (1837) | 8822 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 31448 | |
| - | 17940 | |
| F | 13456 | |
| 1 | 13451 | |
| O | 5532 | 4.1% |
| 3 | 4355 | 3.2% |
| 2 | 4342 | 3.2% |
| 4 | 4330 | 3.2% |
| A | 4248 | 3.2% |
| C | 3213 | 2.4% |
| Other values (17) | 32235 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 71760 | |
| Uppercase Letter | 44850 | |
| Dash Punctuation | 17940 | 13.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 13456 | |
| O | 5532 | |
| A | 4248 | 9.5% |
| C | 3213 | 7.2% |
| T | 2928 | 6.5% |
| U | 2920 | 6.5% |
| R | 2715 | 6.1% |
| P | 2616 | 5.8% |
| E | 2040 | 4.5% |
| H | 1480 | 3.3% |
| Other values (6) | 3702 | 8.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 31448 | |
| 1 | 13451 | |
| 3 | 4355 | 6.1% |
| 2 | 4342 | 6.1% |
| 4 | 4330 | 6.0% |
| 5 | 3098 | 4.3% |
| 7 | 2760 | 3.8% |
| 9 | 2720 | 3.8% |
| 6 | 2662 | 3.7% |
| 8 | 2594 | 3.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 89700 | |
| Latin | 44850 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 13456 | |
| O | 5532 | |
| A | 4248 | 9.5% |
| C | 3213 | 7.2% |
| T | 2928 | 6.5% |
| U | 2920 | 6.5% |
| R | 2715 | 6.1% |
| P | 2616 | 5.8% |
| E | 2040 | 4.5% |
| H | 1480 | 3.3% |
| Other values (6) | 3702 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 0 | 31448 | |
| - | 17940 | |
| 1 | 13451 | |
| 3 | 4355 | 4.9% |
| 2 | 4342 | 4.8% |
| 4 | 4330 | 4.8% |
| 5 | 3098 | 3.5% |
| 7 | 2760 | 3.1% |
| 9 | 2720 | 3.0% |
| 6 | 2662 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 134550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 31448 | |
| - | 17940 | |
| F | 13456 | |
| 1 | 13451 | |
| O | 5532 | 4.1% |
| 3 | 4355 | 3.2% |
| 2 | 4342 | 3.2% |
| 4 | 4330 | 3.2% |
| A | 4248 | 3.2% |
| C | 3213 | 2.4% |
| Other values (17) | 32235 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| Office Supplies | |
|---|---|
| Furniture | |
| Technology |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.7154961 |
| Min length | 9 |
Characters and Unicode
| Total characters | 114058 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Furniture |
|---|---|
| 2nd row | Furniture |
| 3rd row | Office Supplies |
| 4th row | Furniture |
| 5th row | Office Supplies |
Common Values
| Value | Count | Frequency (%) |
| Office Supplies | 5257 | |
| Furniture | 1927 | 21.5% |
| Technology | 1786 | 19.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| office | 5257 | |
| supplies | 5257 | |
| furniture | 1927 | 13.5% |
| technology | 1786 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14227 | |
| i | 12441 | |
| p | 10514 | 9.2% |
| f | 10514 | 9.2% |
| u | 9111 | 8.0% |
| c | 7043 | 6.2% |
| l | 7043 | 6.2% |
| O | 5257 | 4.6% |
| s | 5257 | 4.6% |
| S | 5257 | 4.6% |
| Other values (10) | 27394 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 94574 | |
| Uppercase Letter | 14227 | 12.5% |
| Space Separator | 5257 | 4.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14227 | |
| i | 12441 | |
| p | 10514 | |
| f | 10514 | |
| u | 9111 | |
| c | 7043 | |
| l | 7043 | |
| s | 5257 | 5.6% |
| r | 3854 | 4.1% |
| n | 3713 | 3.9% |
| Other values (5) | 10857 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 5257 | |
| S | 5257 | |
| F | 1927 | 13.5% |
| T | 1786 | 12.6% |
Space Separator
| Value | Count | Frequency (%) |
| 5257 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 108801 | |
| Common | 5257 | 4.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14227 | |
| i | 12441 | |
| p | 10514 | |
| f | 10514 | |
| u | 9111 | |
| c | 7043 | 6.5% |
| l | 7043 | 6.5% |
| O | 5257 | 4.8% |
| s | 5257 | 4.8% |
| S | 5257 | 4.8% |
| Other values (9) | 22137 |
Common
| Value | Count | Frequency (%) |
| 5257 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114058 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 14227 | |
| i | 12441 | |
| p | 10514 | 9.2% |
| f | 10514 | 9.2% |
| u | 9111 | 8.0% |
| c | 7043 | 6.2% |
| l | 7043 | 6.2% |
| O | 5257 | 4.6% |
| s | 5257 | 4.6% |
| S | 5257 | 4.6% |
| Other values (10) | 27394 |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| Paper | |
|---|---|
| Binders | |
| Phones | |
| Storage | |
| Furnishings | |
| Other values (12) |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 7.115942029 |
| Min length | 3 |
Characters and Unicode
| Total characters | 63830 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bookcases |
|---|---|
| 2nd row | Chairs |
| 3rd row | Labels |
| 4th row | Tables |
| 5th row | Storage |
Common Values
| Value | Count | Frequency (%) |
| Paper | 1348 | |
| Binders | 889 | |
| Phones | 875 | |
| Storage | 831 | |
| Furnishings | 804 | |
| Art | 788 | |
| Accessories | 754 | |
| Chairs | 605 | |
| Appliances | 393 | 4.4% |
| Labels | 354 | 3.9% |
| Other values (7) | 1329 |
Length
| Value | Count | Frequency (%) |
| paper | 1348 | |
| binders | 889 | |
| phones | 875 | |
| storage | 831 | |
| furnishings | 804 | |
| art | 788 | |
| accessories | 754 | |
| chairs | 605 | |
| appliances | 393 | 4.4% |
| labels | 354 | 3.9% |
| Other values (7) | 1329 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8733 | |
| e | 7992 | |
| r | 6298 | 9.9% |
| i | 4595 | 7.2% |
| a | 4349 | 6.8% |
| n | 4319 | 6.8% |
| o | 3196 | 5.0% |
| p | 2834 | 4.4% |
| h | 2373 | 3.7% |
| P | 2223 | 3.5% |
| Other values (18) | 16918 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54860 | |
| Uppercase Letter | 8970 | 14.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 8733 | |
| e | 7992 | |
| r | 6298 | |
| i | 4595 | |
| a | 4349 | |
| n | 4319 | |
| o | 3196 | 5.8% |
| p | 2834 | 5.2% |
| h | 2373 | 4.3% |
| c | 2197 | 4.0% |
| Other values (8) | 7974 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2223 | |
| A | 1935 | |
| B | 1096 | |
| S | 1020 | |
| F | 1015 | |
| C | 673 | 7.5% |
| L | 354 | 3.9% |
| T | 311 | 3.5% |
| E | 254 | 2.8% |
| M | 89 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 63830 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 8733 | |
| e | 7992 | |
| r | 6298 | 9.9% |
| i | 4595 | 7.2% |
| a | 4349 | 6.8% |
| n | 4319 | 6.8% |
| o | 3196 | 5.0% |
| p | 2834 | 4.4% |
| h | 2373 | 3.7% |
| P | 2223 | 3.5% |
| Other values (18) | 16918 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63830 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 8733 | |
| e | 7992 | |
| r | 6298 | 9.9% |
| i | 4595 | 7.2% |
| a | 4349 | 6.8% |
| n | 4319 | 6.8% |
| o | 3196 | 5.0% |
| p | 2834 | 4.4% |
| h | 2373 | 3.7% |
| P | 2223 | 3.5% |
| Other values (18) | 16918 |
| Distinct | 1835 |
|---|---|
| Distinct (%) | 20.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.2 KiB |
| Staple envelope | 48 |
|---|---|
| Easy-staple paper | 46 |
| Staples | 43 |
| Staple remover | 18 |
| KI Adjustable-Height Table | 18 |
| Other values (1830) |
Length
| Max length | 127 |
|---|---|
| Median length | 78 |
| Mean length | 36.46465998 |
| Min length | 5 |
Characters and Unicode
| Total characters | 327088 |
|---|---|
| Distinct characters | 85 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 121 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | Bush Somerset Collection Bookcase |
|---|---|
| 2nd row | Hon Deluxe Fabric Upholstered Stacking Chairs, Rounded Back |
| 3rd row | Self-Adhesive Address Labels for Typewriters by Universal |
| 4th row | Bretford CR4500 Series Slim Rectangular Table |
| 5th row | Eldon Fold 'N Roll Cart System |
Common Values
| Value | Count | Frequency (%) |
| Staple envelope | 48 | 0.5% |
| Easy-staple paper | 46 | 0.5% |
| Staples | 43 | 0.5% |
| Staple remover | 18 | 0.2% |
| KI Adjustable-Height Table | 18 | 0.2% |
| Staples in misc. colors | 17 | 0.2% |
| Logitech 910-002974 M325 Wireless Mouse for Web Scrolling | 14 | 0.2% |
| Situations Contoured Folding Chairs, 4/Set | 14 | 0.2% |
| Global Wood Trimmed Manager's Task Chair, Khaki | 14 | 0.2% |
| Global High-Back Leather Tilter, Burgundy | 14 | 0.2% |
| Other values (1825) | 8724 |
Length
| Value | Count | Frequency (%) |
| xerox | 851 | 1.7% |
| x | 641 | 1.3% |
| 558 | 1.1% | |
| with | 511 | 1.0% |
| chair | 461 | 0.9% |
| avery | 446 | 0.9% |
| for | 445 | 0.9% |
| black | 375 | 0.8% |
| phone | 367 | 0.7% |
| file | 322 | 0.6% |
| Other values (2781) | 44818 |
Most occurring characters
| Value | Count | Frequency (%) |
| 40455 | 12.4% | |
| e | 29872 | 9.1% |
| r | 18343 | 5.6% |
| o | 17781 | 5.4% |
| a | 17158 | 5.2% |
| i | 16150 | 4.9% |
| l | 14624 | 4.5% |
| n | 13362 | 4.1% |
| s | 12830 | 3.9% |
| t | 12818 | 3.9% |
| Other values (75) | 133695 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 210450 | |
| Uppercase Letter | 49717 | 15.2% |
| Space Separator | 40874 | 12.5% |
| Decimal Number | 16864 | 5.2% |
| Other Punctuation | 6340 | 1.9% |
| Dash Punctuation | 2611 | 0.8% |
| Final Punctuation | 66 | < 0.1% |
| Close Punctuation | 56 | < 0.1% |
| Open Punctuation | 56 | < 0.1% |
| Math Symbol | 31 | < 0.1% |
| Other values (2) | 23 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 29872 | |
| r | 18343 | 8.7% |
| o | 17781 | 8.4% |
| a | 17158 | 8.2% |
| i | 16150 | 7.7% |
| l | 14624 | 6.9% |
| n | 13362 | 6.3% |
| s | 12830 | 6.1% |
| t | 12818 | 6.1% |
| c | 7797 | 3.7% |
| Other values (18) | 49715 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 5667 | 11.4% |
| C | 5276 | 10.6% |
| B | 4611 | 9.3% |
| P | 4308 | 8.7% |
| M | 2633 | 5.3% |
| A | 2588 | 5.2% |
| D | 2554 | 5.1% |
| T | 2405 | 4.8% |
| F | 2284 | 4.6% |
| L | 2011 | 4.0% |
| Other values (16) | 15380 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2739 | |
| / | 1402 | |
| " | 1128 | |
| . | 429 | 6.8% |
| & | 259 | 4.1% |
| ' | 229 | 3.6% |
| # | 89 | 1.4% |
| % | 42 | 0.7% |
| * | 8 | 0.1% |
| ! | 5 | 0.1% |
| Other values (2) | 10 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3486 | |
| 0 | 2742 | |
| 2 | 2135 | |
| 4 | 1617 | |
| 3 | 1431 | |
| 5 | 1381 | 8.2% |
| 9 | 1183 | 7.0% |
| 8 | 1169 | 6.9% |
| 6 | 887 | 5.3% |
| 7 | 833 | 4.9% |
Space Separator
| Value | Count | Frequency (%) |
| 40455 | ||
| 419 | 1.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2611 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 66 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 56 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 56 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 31 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 18 |
Other Number
| Value | Count | Frequency (%) |
| ¾ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 260167 | |
| Common | 66921 | 20.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 29872 | 11.5% |
| r | 18343 | 7.1% |
| o | 17781 | 6.8% |
| a | 17158 | 6.6% |
| i | 16150 | 6.2% |
| l | 14624 | 5.6% |
| n | 13362 | 5.1% |
| s | 12830 | 4.9% |
| t | 12818 | 4.9% |
| c | 7797 | 3.0% |
| Other values (44) | 99432 |
Common
| Value | Count | Frequency (%) |
| 40455 | ||
| 1 | 3486 | 5.2% |
| 0 | 2742 | 4.1% |
| , | 2739 | 4.1% |
| - | 2611 | 3.9% |
| 2 | 2135 | 3.2% |
| 4 | 1617 | 2.4% |
| 3 | 1431 | 2.1% |
| / | 1402 | 2.1% |
| 5 | 1381 | 2.1% |
| Other values (21) | 6922 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 326566 | |
| None | 438 | 0.1% |
| Punctuation | 84 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 40455 | 12.4% | |
| e | 29872 | 9.1% |
| r | 18343 | 5.6% |
| o | 17781 | 5.4% |
| a | 17158 | 5.3% |
| i | 16150 | 4.9% |
| l | 14624 | 4.5% |
| n | 13362 | 4.1% |
| s | 12830 | 3.9% |
| t | 12818 | 3.9% |
| Other values (69) | 133173 |
None
| Value | Count | Frequency (%) |
| 419 | ||
| é | 12 | 2.7% |
| ¾ | 5 | 1.1% |
| à | 2 | 0.5% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 66 | |
| “ | 18 | 21.4% |
| Distinct | 5150 |
|---|---|
| Distinct (%) | 57.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 237.502482 |
| Minimum | 0.9900000095 |
|---|---|
| Maximum | 22638.48047 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.2 KiB |
Quantile statistics
| Minimum | 0.9900000095 |
|---|---|
| 5-th percentile | 6.137999868 |
| Q1 | 19.44000053 |
| median | 60.84000015 |
| Q3 | 225.2960052 |
| 95-th percentile | 960.8067932 |
| Maximum | 22638.48047 |
| Range | 22637.49047 |
| Interquartile range (IQR) | 205.8560047 |
Descriptive statistics
| Standard deviation | 630.8734131 |
|---|---|
| Coefficient of variation (CV) | 2.656281348 |
| Kurtosis | 316.3005676 |
| Mean | 237.502482 |
| Median Absolute Deviation (MAD) | 50.15200043 |
| Skewness | 13.28466034 |
| Sum | 2130397.264 |
| Variance | 398001.2812 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.96000004 | 56 | 0.6% |
| 15.55200005 | 39 | 0.4% |
| 19.44000053 | 39 | 0.4% |
| 25.92000008 | 36 | 0.4% |
| 10.36800003 | 36 | 0.4% |
| 32.40000153 | 28 | 0.3% |
| 17.94000053 | 21 | 0.2% |
| 6.480000019 | 20 | 0.2% |
| 20.73600006 | 19 | 0.2% |
| 14.93999958 | 17 | 0.2% |
| Other values (5140) | 8659 |
| Value | Count | Frequency (%) |
| 0.9900000095 | 1 | < 0.1% |
| 1.24000001 | 1 | < 0.1% |
| 1.343999982 | 3 | |
| 1.407999992 | 1 | < 0.1% |
| 1.440000057 | 1 | < 0.1% |
| 1.447999954 | 1 | < 0.1% |
| 1.503999949 | 1 | < 0.1% |
| 1.583999991 | 1 | < 0.1% |
| 1.631999969 | 1 | < 0.1% |
| 1.639999986 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 22638.48047 | 1 | |
| 17499.94922 | 1 | |
| 13999.95996 | 1 | |
| 11199.96777 | 1 | |
| 10499.96973 | 1 | |
| 9449.950195 | 1 | |
| 9099.929688 | 1 | |
| 8749.950195 | 1 | |
| 8399.975586 | 1 | |
| 8187.649902 | 1 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.63690078 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.971940128 |
|---|---|
| Coefficient of variation (CV) | 0.5422034439 |
| Kurtosis | 0.1128219995 |
| Mean | 3.63690078 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.8725743112 |
| Sum | 32623 |
| Variance | 3.88854787 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2218 | |
| 2 | 2207 | |
| 5 | 1111 | |
| 4 | 1086 | |
| 1 | 827 | 9.2% |
| 7 | 545 | 6.1% |
| 6 | 514 | 5.7% |
| 9 | 234 | 2.6% |
| 8 | 228 | 2.5% |
| Value | Count | Frequency (%) |
| 1 | 827 | 9.2% |
| 2 | 2207 | |
| 3 | 2218 | |
| 4 | 1086 | |
| 5 | 1111 | |
| 6 | 514 | 5.7% |
| 7 | 545 | 6.1% |
| 8 | 228 | 2.5% |
| 9 | 234 | 2.6% |
| Value | Count | Frequency (%) |
| 9 | 234 | 2.6% |
| 8 | 228 | 2.5% |
| 7 | 545 | 6.1% |
| 6 | 514 | 5.7% |
| 5 | 1111 | |
| 4 | 1086 | |
| 3 | 2218 | |
| 2 | 2207 | |
| 1 | 827 | 9.2% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1034860663 |
| Minimum | 0 |
|---|---|
| Maximum | 0.5 |
| Zeros | 4712 |
| Zeros (%) | 52.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.200000003 |
| 95-th percentile | 0.3000000119 |
| Maximum | 0.5 |
| Range | 0.5 |
| Interquartile range (IQR) | 0.200000003 |
Descriptive statistics
| Standard deviation | 0.1171929389 |
|---|---|
| Coefficient of variation (CV) | 1.132451383 |
| Kurtosis | -0.2922693789 |
| Mean | 0.1034860663 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.6763803959 |
| Sum | 928.2700147 |
| Variance | 0.01373418421 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4712 | |
| 0.200000003 | 3592 | |
| 0.3000000119 | 224 | 2.5% |
| 0.400000006 | 200 | 2.2% |
| 0.1000000015 | 89 | 1.0% |
| 0.5 | 66 | 0.7% |
| 0.150000006 | 50 | 0.6% |
| 0.3199999928 | 26 | 0.3% |
| 0.4499999881 | 11 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4712 | |
| 0.1000000015 | 89 | 1.0% |
| 0.150000006 | 50 | 0.6% |
| 0.200000003 | 3592 | |
| 0.3000000119 | 224 | 2.5% |
| 0.3199999928 | 26 | 0.3% |
| 0.400000006 | 200 | 2.2% |
| 0.4499999881 | 11 | 0.1% |
| 0.5 | 66 | 0.7% |
| Value | Count | Frequency (%) |
| 0.5 | 66 | 0.7% |
| 0.4499999881 | 11 | 0.1% |
| 0.400000006 | 200 | 2.2% |
| 0.3199999928 | 26 | 0.3% |
| 0.3000000119 | 224 | 2.5% |
| 0.200000003 | 3592 | |
| 0.150000006 | 50 | 0.6% |
| 0.1000000015 | 89 | 1.0% |
| 0 | 4712 |
| Distinct | 6372 |
|---|---|
| Distinct (%) | 71.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.43068337 |
| Minimum | -3839.990479 |
|---|---|
| Maximum | 8399.975586 |
| Zeros | 64 |
| Zeros (%) | 0.7% |
| Negative | 1001 |
| Negative (%) | 11.2% |
| Memory size | 35.2 KiB |
Quantile statistics
| Minimum | -3839.990479 |
|---|---|
| 5-th percentile | -31.75479994 |
| Q1 | 3.235199928 |
| median | 9.920800209 |
| Q3 | 32.65540028 |
| 95-th percentile | 176.6062248 |
| Maximum | 8399.975586 |
| Range | 12239.96606 |
| Interquartile range (IQR) | 29.42020035 |
Descriptive statistics
| Standard deviation | 208.543335 |
|---|---|
| Coefficient of variation (CV) | 5.426480007 |
| Kurtosis | 523.6069336 |
| Mean | 38.43068337 |
| Median Absolute Deviation (MAD) | 9.044400215 |
| Skewness | 16.22002792 |
| Sum | 344723.2298 |
| Variance | 43490.32031 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 64 | 0.7% |
| 6.220799923 | 43 | 0.5% |
| 9.331199646 | 38 | 0.4% |
| 5.443200111 | 32 | 0.4% |
| 3.628799915 | 32 | 0.4% |
| 15.55200005 | 26 | 0.3% |
| 12.44159985 | 21 | 0.2% |
| 7.257599831 | 19 | 0.2% |
| 3.110399961 | 18 | 0.2% |
| 9.07199955 | 11 | 0.1% |
| Other values (6362) | 8666 |
| Value | Count | Frequency (%) |
| -3839.990479 | 1 | |
| -1811.078369 | 1 | |
| -1665.052246 | 1 | |
| -1359.991943 | 1 | |
| -1049.340576 | 1 | |
| -1002.78363 | 1 | |
| -968.8833008 | 1 | |
| -944.9946289 | 1 | |
| -814.4832153 | 1 | |
| -786.0144043 | 1 |
| Value | Count | Frequency (%) |
| 8399.975586 | 1 | |
| 6719.980957 | 1 | |
| 5039.98584 | 1 | |
| 4630.475586 | 1 | |
| 3919.98877 | 1 | |
| 3177.475098 | 1 | |
| 2799.983887 | 1 | |
| 2591.956787 | 1 | |
| 2504.22168 | 1 | |
| 2400.96582 | 1 |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | RowID | OrderID | OrderDate | ShipDate | ShipMode | CustomerID | CustomerName | Segment | Country | City | State | PostalCode | Region | ProductID | Category | SubCategory | ProductName | Sales | Quantity | Discount | Profit | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1 | CA-2016-152156 | 11/8/2016 | 11/11/2016 | Second Class | CG-12520 | Claire Gute | Consumer | United States | Henderson | Kentucky | 42420 | South | FUR-BO-10001798 | Furniture | Bookcases | Bush Somerset Collection Bookcase | 261.959991 | 2 | 0.00 | 41.913601 |
| 1 | 1 | 2 | CA-2016-152156 | 11/8/2016 | 11/11/2016 | Second Class | CG-12520 | Claire Gute | Consumer | United States | Henderson | Kentucky | 42420 | South | FUR-CH-10000454 | Furniture | Chairs | Hon Deluxe Fabric Upholstered Stacking Chairs, Rounded Back | 731.940002 | 3 | 0.00 | 219.582001 |
| 2 | 2 | 3 | CA-2016-138688 | 6/12/2016 | 6/16/2016 | Second Class | DV-13045 | Darrin Van Huff | Corporate | United States | Los Angeles | California | 90036 | West | OFF-LA-10000240 | Office Supplies | Labels | Self-Adhesive Address Labels for Typewriters by Universal | 14.620000 | 2 | 0.00 | 6.871400 |
| 3 | 3 | 4 | US-2015-108966 | 10/11/2015 | 10/18/2015 | Standard Class | SO-20335 | Sean O'Donnell | Consumer | United States | Fort Lauderdale | Florida | 33311 | South | FUR-TA-10000577 | Furniture | Tables | Bretford CR4500 Series Slim Rectangular Table | 957.577515 | 5 | 0.45 | -383.031006 |
| 4 | 4 | 5 | US-2015-108966 | 10/11/2015 | 10/18/2015 | Standard Class | SO-20335 | Sean O'Donnell | Consumer | United States | Fort Lauderdale | Florida | 33311 | South | OFF-ST-10000760 | Office Supplies | Storage | Eldon Fold 'N Roll Cart System | 22.368000 | 2 | 0.20 | 2.516400 |
| 5 | 5 | 6 | CA-2014-115812 | 6/9/2014 | 6/14/2014 | Standard Class | BH-11710 | Brosina Hoffman | Consumer | United States | Los Angeles | California | 90032 | West | FUR-FU-10001487 | Furniture | Furnishings | Eldon Expressions Wood and Plastic Desk Accessories, Cherry Wood | 48.860001 | 7 | 0.00 | 14.169400 |
| 6 | 6 | 7 | CA-2014-115812 | 6/9/2014 | 6/14/2014 | Standard Class | BH-11710 | Brosina Hoffman | Consumer | United States | Los Angeles | California | 90032 | West | OFF-AR-10002833 | Office Supplies | Art | Newell 322 | 7.280000 | 4 | 0.00 | 1.965600 |
| 7 | 7 | 8 | CA-2014-115812 | 6/9/2014 | 6/14/2014 | Standard Class | BH-11710 | Brosina Hoffman | Consumer | United States | Los Angeles | California | 90032 | West | TEC-PH-10002275 | Technology | Phones | Mitel 5320 IP Phone VoIP phone | 907.151978 | 6 | 0.20 | 90.715202 |
| 8 | 8 | 9 | CA-2014-115812 | 6/9/2014 | 6/14/2014 | Standard Class | BH-11710 | Brosina Hoffman | Consumer | United States | Los Angeles | California | 90032 | West | OFF-BI-10003910 | Office Supplies | Binders | DXL Angle-View Binders with Locking Rings by Samsill | 18.504000 | 3 | 0.20 | 5.782500 |
| 9 | 9 | 10 | CA-2014-115812 | 6/9/2014 | 6/14/2014 | Standard Class | BH-11710 | Brosina Hoffman | Consumer | United States | Los Angeles | California | 90032 | West | OFF-AP-10002892 | Office Supplies | Appliances | Belkin F5C206VTEL 6 Outlet Surge | 114.900002 | 5 | 0.00 | 34.470001 |
Last rows
| df_index | RowID | OrderID | OrderDate | ShipDate | ShipMode | CustomerID | CustomerName | Segment | Country | City | State | PostalCode | Region | ProductID | Category | SubCategory | ProductName | Sales | Quantity | Discount | Profit | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8960 | 9983 | 9984 | US-2016-157728 | 9/22/2016 | 9/28/2016 | Standard Class | RC-19960 | Ryan Crowe | Consumer | United States | Grand Rapids | Michigan | 49505 | Central | TEC-PH-10001305 | Technology | Phones | Panasonic KX TS208W Corded phone | 97.980003 | 2 | 0.0 | 27.434401 |
| 8961 | 9985 | 9986 | CA-2015-100251 | 5/17/2015 | 5/23/2015 | Standard Class | DV-13465 | Dianna Vittorini | Consumer | United States | Long Beach | New York | 11561 | East | OFF-SU-10000898 | Office Supplies | Supplies | Acme Hot Forged Carbon Steel Scissors with Nickel-Plated Handles, 3 7/8" Cut, 8"L | 55.599998 | 4 | 0.0 | 16.124001 |
| 8962 | 9986 | 9987 | CA-2016-125794 | 9/29/2016 | 10/3/2016 | Standard Class | ML-17410 | Maris LaWare | Consumer | United States | Los Angeles | California | 90008 | West | TEC-AC-10003399 | Technology | Accessories | Memorex Mini Travel Drive 64 GB USB 2.0 Flash Drive | 36.240002 | 1 | 0.0 | 15.220800 |
| 8963 | 9987 | 9988 | CA-2017-163629 | 11/17/2017 | 11/21/2017 | Standard Class | RA-19885 | Ruben Ausman | Corporate | United States | Athens | Georgia | 30605 | South | TEC-AC-10001539 | Technology | Accessories | Logitech G430 Surround Sound Gaming Headset with Dolby 7.1 Technology | 79.989998 | 1 | 0.0 | 28.796400 |
| 8964 | 9988 | 9989 | CA-2017-163629 | 11/17/2017 | 11/21/2017 | Standard Class | RA-19885 | Ruben Ausman | Corporate | United States | Athens | Georgia | 30605 | South | TEC-PH-10004006 | Technology | Phones | Panasonic KX - TS880B Telephone | 206.100006 | 5 | 0.0 | 55.646999 |
| 8965 | 9989 | 9990 | CA-2014-110422 | 1/21/2014 | 1/23/2014 | Second Class | TB-21400 | Tom Boeckenhauer | Consumer | United States | Miami | Florida | 33180 | South | FUR-FU-10001889 | Furniture | Furnishings | Ultra Door Pull Handle | 25.247999 | 3 | 0.2 | 4.102800 |
| 8966 | 9990 | 9991 | CA-2017-121258 | 2/26/2017 | 3/3/2017 | Standard Class | DB-13060 | Dave Brooks | Consumer | United States | Costa Mesa | California | 92627 | West | FUR-FU-10000747 | Furniture | Furnishings | Tenex B1-RE Series Chair Mats for Low Pile Carpets | 91.959999 | 2 | 0.0 | 15.633200 |
| 8967 | 9991 | 9992 | CA-2017-121258 | 2/26/2017 | 3/3/2017 | Standard Class | DB-13060 | Dave Brooks | Consumer | United States | Costa Mesa | California | 92627 | West | TEC-PH-10003645 | Technology | Phones | Aastra 57i VoIP phone | 258.575989 | 2 | 0.2 | 19.393200 |
| 8968 | 9992 | 9993 | CA-2017-121258 | 2/26/2017 | 3/3/2017 | Standard Class | DB-13060 | Dave Brooks | Consumer | United States | Costa Mesa | California | 92627 | West | OFF-PA-10004041 | Office Supplies | Paper | It's Hot Message Books with Stickers, 2 3/4" x 5" | 29.600000 | 4 | 0.0 | 13.320000 |
| 8969 | 9993 | 9994 | CA-2017-119914 | 5/4/2017 | 5/9/2017 | Second Class | CC-12220 | Chris Cortes | Consumer | United States | Westminster | California | 92683 | West | OFF-AP-10002684 | Office Supplies | Appliances | Acco 7-Outlet Masterpiece Power Center, Wihtout Fax/Phone Line Protection | 243.160004 | 2 | 0.0 | 72.947998 |